Implementation of the DWT using Intel IA-32 SIMD Extensions

نویسندگان

  • ENRIQUE P. MARTIN
  • JOSE SALVADO
چکیده

This paper presents some results on the implementation of the DWT (DWT) trough the lifting scheme by using general purpose processor SIMD extensions. We perform image analysis and reconstruction up to 3 levels of decomposition, using the DWT factored into lifting steps for the 9/7 wavelet filter pair. The algorithm was implemented in “C” code and evaluated in terms of performance and image degradation. Three approaches were used: floating-point representation, integer fixed-point representation and SIMD extensions integer code. The results obtained when compared to floating-point code implementation, indicate that the processing time for fixed-point is around 54% and SIMD extensions code is around 24.2%. The average PSNR results are also better for fixed-point and SIMD extensions than with floating-point code implementation. Key-Words: DWT, Lifting Scheme, SIMD Extension, Intel IA-32 GPP Architecture

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Portable Short Vector Version of Fftw

This paper presents a portable short vector extension for the popular FFT library Fftw. Fftw is a freely available portable FFT software-library that achieves top performance across a large number of platforms. The newly developed extension enables the utilization of short vector extensions like Intel’s SSE and SSE 2 as well as Motorola’s AltiVec for any problem sizes. The method is independent...

متن کامل

IA-32 Execution Layer: a two-phase dynamic translator designed to support IA-32 applications on Itanium-based systems

IA-32 Execution Layer (IA-32 EL) is a new technology that executes IA-32 applications on Intel® Itanium® processor family systems. Currently, support for IA-32 applications on Itanium-based platforms is achieved using hardware circuitry on the Itanium processors. This capability will be enhanced with IA-32 EL—software that will ship with Itanium-based operating systems and will convert IA-32 in...

متن کامل

Pentium III Processor Implementation Tradeoffs

This paper discusses the implementation tradeoffs of the Pentium III processor. The Pentium III processor implements a new extension of the IA-32 instruction set called the Internet Streaming Single-Instruction, MultipleData (SIMD) Extensions (Internet SSE). The processor is based on the Pentium Pro processor microarchitecture. The initial development goals for the Pentium III processor were ...

متن کامل

Vectorization of the 2D Wavelet Lifting Transform Using SIMD Extensions

This paper addresses the vectorization of the lifting-based wavelet transform on general-purpose microprocessors in the context of JPEG2000. Since SIMD exploitation strongly depends on an efficient memory hierarchy usage, this research is based on previous work about cacheconscious DWT implementations [1,2,3]. The experimental platform on which we have chosen to study the benefits of the SIMD e...

متن کامل

An Implementation of Parallel 1-D FFT Using SSE3 Instructions on Dual-Core Processors

In the present paper, an implementation of a parallel one-dimensional fast Fourier transform (FFT) using Streaming SIMD Extensions 3 (SSE3) instructions on dual-core processors is proposed. Combination of vectorization and the block six-step FFT algorithm is shown to effectively improve performance. The performance results for one-dimensional FFTs on dual-core Intel Xeon processors are reported...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008